An Efficient Stream Mining Technique
نویسندگان
چکیده
Stream analysis is considered as a crucial component of strategic control over a broad variety of disciplines in business, science and engineering. Stream data is a sequence of observations collected over intervals of time. Each data stream describes a phenomenon. Analysis on Stream data includes discovering trends (or patterns) in a Stream sequence. In the last few years, data mining has emerged and been recognized as a new technology for data analysis. Data Mining is the process of discovering potentially valuable patterns, associations, trends, sequences and dependencies in data. Data mining techniques can discover information that many traditional business analysis and statistical techniques fail to deliver. In our study, we emphasis on the use of data mining techniques on data streams, where mining techniques and tools are used in an attempt to recognize, anticipate and learn the stream behavior with different directly related or looked unrelated factors. Targeted data are sequences of observations collected over intervals of time. Each sequence describes a phenomenon or a factor. Such factors could have either a direct or indirect impact on the stream data under study. Examples of factors with direct impact include the yearly budgets and expenditures, taxations, local stocks prices, unemployment rates, inflation rates, fallen angels, and rising odds for upgrades. Indirect factors could include any phenomena in the local or global environments, such as, global stocks prices, education expenditures, weather conditions, employment strategies, and medical services. Analysis on data includes discovering trends (or patterns) and association between sequences in order to generate non-trivial knowledge. In this paper, we propose a data mining technique to predict the dependency between factors that affect performance. The proposed technique consists of three phases: (a) for each data sequence that represents a chosen phenomenon, generate its trend sequences, (b) discover maximal frequent trend patterns, generate pattern vectors (to keep information of frequent trend patterns), use trend pattern vectors to predict future factor sequences.
منابع مشابه
Resource-Aware Very Fast K-Means for Ubiquitous Data Stream Mining
Developments in data streams, coupled with the growth in mobile and pervasive devices, have led to the emergence of Ubiquitous Data Mining (UDM). UDM aims to perform data stream mining in a ubiquitous environment with resourceconstrained and/or mobile devices. Over the past few years, stream mining techniques have attracted the attention of the data mining community. However these techniques ha...
متن کاملEfficient Classifier Generation over Stream Sliding Window using Associative Classification Approach
Prominence of data streams has dragged the interest of many researchers in the recent past. Mining associative rules generated on data streams for prediction has raised greater research interest in recent years. Associative classification mining has shown better performance over many former classification techniques in Data Mining and Data Stream Mining domains. This paper introduces a new tech...
متن کاملMining Disjunctive Sequential Patterns from News Stream
Frequent disjunctive pattern is known to be a sophisticated method of text mining in a single document that satisfies anti-monotonicity, by which we can discuss efficient algorithm based on APRIORI. In this work, we propose a new online and single-pass algorithm by which we can extract current frequent disjunctive patterns by a weighting method for past events from a news stream. And we discuss...
متن کاملOn Clustering Massive Data Streams: A Summarization Paradigm
In recent years, data streams have become ubiquitous because of the large number of applications which generate huge volumes of data in an automated way. Many existing data mining methods cannot be applied directly on data streams because of the fact that the data needs to be mined in one pass. Furthermore, data streams show a considerable amount of temporal locality because of which a direct a...
متن کاملFP-Viz: Visual Frequent Pattern Mining
Frequent pattern mining plays an essential role in many data analysis tasks including association-, correlation-, and causality analysis and has broad applications. Examples are market basket analysis and web click stream analysis. Although a number of efficient methods for mining frequent patterns where proposed in the past, there exist only a small number of visual exploration tools for disco...
متن کامل